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ABSTRACT 

Information-driven networks include a large category of net- 
working systems, where network nodes are aware of infor- 
mation delivered and thus can not only forward data packets 
but may also perform information processing. In many sit- 
uations, the quality of service (QoS) in information-driven 
networks is provisioned with the redundancy in information. 
Traditional performance models generally adopt evaluation 
measures suitable for packet-oriented service guarantee, such 
as packet delay, throughput, and packet loss rate. These per- 
formance measures, however, do not align well with the ac- 
tual need of information-driven networks. New performance 
measures and models for information-driven networks, de- 
spite their importance, have been mainly blank, largely be- 
cause information processing is clearly application depen- 
dent and cannot be easily captured within a generic frame- 
work. To fill the vacancy, we present a new performance 
evaluation framework particularly tailored for information- 
driven networks, based on the recent development of stochas- 
tic network calculus. We analyze the QoS with respect to 
information delivery and study the scheduling problem with 
the new performance metrics. Our analytical framework can 
be used to calculate the network capacity in information de- 
livery and in the meantime to help transmission scheduling 
for a large body of systems where QoS is stochastically guar- 
anteed with the redundancy in information. 

Categories and Subject Descriptors 

C.4 [Performance of Systems]: Modeling techniques; H.l 
[Models and Principles]: Miscellaneous 

General Terms 

Theory, Performance 

Keywords 

Network Calculus, Information-Driven Networks, Performance 
Modeling 
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1. INTRODUCTION 

Although computer networks in general are purposed for 
information delivery, most existing network architectures 
like the Internet are actually not information driven in the 
sense that network nodes (e.g., routers and switchers) only 
care about packets instead of the information inside. As 
a common principle, network nodes as well as the whole 
network system are designed to support quality of service 
(QoS) with respect to packet-oriented service measures such 
as bounded packet delay and promised data throughput. 
To achieve this, QoS provisioning mechanisms ^Sj have been 
proposed and used in the Internet, including admission con- 
trol, congestion control, resource reservation, QoS routing, 
and so on. It has been observed that on the one hand QoS 
provisioning mechanisms provide certain service guarantee 
to privileged data traffic; on the other hand they largely in- 
crease the system complexity and incur a heavy burden on 
network nodes. Many emerging network systems, for exam- 
ple, wireless sensor networks, consist of nodes with only very 
limited computational capability and thus do not have the 
luxury to accommodate complex QoS mechanisms. Nev- 
ertheless, QoS is important in any means. For instance, 
in a patient-monitoring system or a fire alarm system with 
wireless sensor networks, we certainly require important in- 
formation like abnormal heart beats or high temperature 
readings to be delivered correctly to a monitoring center. 
The dilemma we face is to guarantee QoS maybe without 
any underlying promise from network nodes on timely per 
packet delivery. 

The traditional meaning of QoS, e.g., for guaranteed per 
packet delivery and end-to-end delay, is actually an overkill, 
since all we care is information. The traditional solutions fo- 
cusing on packets instead of the information inside have the 
historical reason: the network protocol stack is layered and 
network protocols should not mix up with application-layer 
information. With the emergence of new technologies such 
as wireless sensor networks, however, the layering principle 
is not necessarily a rule of thumb, and the redundancy in 
the information sources should be utilized in network pro- 
tocol design. A network node may not be purely a data 
forwarding device. Instead, it may become aware of infor- 
mation forwarded and is able to perform information pro- 
cessing whenever necessary. The ultimate goal of the whole 
network system is no longer to guarantee service for indi- 
vidual packets, but to guarantee a certain amount of infor- 
mation to be successfully transported. We call this type 
of networking systems mformation-driven networks. Typi- 
cal examples include wireless sensor networks with directed 
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Figure 1: A simple example of wireless sensor net- 
works 



data diffusion [10] . distributed content sharing over peer-to- 
peer networks [U [T5] , and networks using network coding [1] 
[26]. 

Making the network to be information driven opens spe- 
cial opportunities for QoS provisioning, e.g., in environment 
where the network is subject to high packet losses or net- 
work nodes are stringently constrained by computational 
power and limited bandwidth. In applications where data 
exhibit spatial and/or temporal correlation, it is unnecessary 
to provide reliable transmission for each individual packet. 
Instead, QoS is guaranteed as long as required information 
can be obtained as sure (i.e., with a very high probability). 
We use a simple example to illustrate the advantages of al- 
lowing the network to become information aware. 

Example 1. Assume that a wireless sensor network in- 
cludes six sensor nodes and one processing center, also called 
the sink node, as shown in Figure\l\ Four sensors at the bot- 
tom of the figure monitor the environment and periodically 
send out measurement data like temperature, humidity read- 
ings. Two sensors in the middle of the figure are used as data 
relay to the sink. Wireless links are generally subject to a 
high loss rate in wireless sensor networks j23f . so we assume 
that the average packet loss rate is 25% for each wireless 
link. Without considering information, we treat the network 
as purely a data delivery system like the Internet. In this 
case, we need to make sure that each data packet is correctly 
delivered from the source to the sink with a high probability. 
If we set this probability to be no smaller than 96%, we need 
about 24 transmissions in total (calculated with two retrans- 
missions each link to guarantee the high probability of correct 
end-to-end packet delivery). In contrast, if relay nodes know 
that the information from the four source sensors is highly 
correlated and if the information is considered to be delivered 
as long as at least one packet from the sources is received by 
the sink, eight transmissions (e.g., without any retransmis- 
sions) can guarantee that the information is delivered with 
a probability no smaller than 96%. 

The above example clearly illustrates the necessity of tak- 
ing information into consideration. Yet, several difficulties 
need to address even in the very simple example. First, 
how can we capture and model the correlation at the in- 



formation sources? In addition, the correlation may change 
over time. How can we capture the dynamic changes in 
a timely fashion? Second, the above example only consid- 
ers the correlation at the information sources, how can we 
perform information processing at intermediate relay nodes 
for better QoS provisioning and resource saving? Third, the 
use of application-layer information in network protocols has 
changed the fundamental design principle of current Inter- 
net architecture, where the network is considered as a packet 
transportation tool and the service guarantee is promised for 
individual packets. This fundamental change renders tradi- 
tional performance modeling and evaluation approaches in- 
valid for information-driven networks. For instance, network 
throughput in terms of number of bits per time unit and end- 
to-end packet delay are no longer good measures and new 
metrics should be used to align with the need of information- 
driven networks. What should be a good model for perfor- 
mance evaluation and resource scheduling for information- 
driven networks? 

During the last several years, there are substantial re- 
search efforts devoted to tackling the first two difficulties. 
Particularly, the spatial and temporal correlations of infor- 
mation have been studied and utilized in network scheduling 
and resource saving in wireless sensor networks [181 [2DJ [21] ; 
information redundancy has been exploited to help load bal- 
ancing and improve fault tolerance in peer-to-peer content 
sharing systems [21 [T2]. Regarding the third challenge, to 
the best of our knowledge, the only attempts to accommo- 
date information processing in performance modeling are the 
work in 8, 21 . Nevertheless, information processing is sim- 
ply modeled with a scaling function in |8] [21] and the infor- 
mation embedded in data packets has not been modeled, let 
alone utilized. In this sense, the performance models in [8] 
I21j are not really information-driven models. A systematic 
performance modeling framework suitable for information- 
driven networks still remains largely open. 

In this paper, we propose the first-of-the-kind analyti- 
cal framework suitable for performance study and resource 
scheduling of information-driven networks. Particularly, we 
make the following contributions: 

1. We present a comprehensive performance model for 
information-driven networks based on the recent de- 
velopment of stochastic network calculus. Particularly, 
our model captures the information correlation and the 
QoS guarantee with respect to stochastic information 
delivery rates, which have never been formally mod- 
eled before. 

2. We study the stochastically achievable network capac- 
ity in information delivery and derive the bounds on 
probability of information delivery within a given end- 
to-end information delay. 

3. We study the transmission scheduling problem in in- 
formation - driven networks and design algorithms to 
search for feasible transmission schedules to meet given 
QoS requirements. 

4. We give examples to illustrate how our analytical frame- 
work can be used to solve problems in practice. 

The rest of the paper is organized as follows. Section [2] 
includes the background knowledge on stochastic network 
calculus. In Sections [3] and [4] we present a calculus and a 



system model for information-driven networks, respectively. 
We analyze the stochastically achievable information deliv- 
ery rates in Sections [S] and study the transmission schedul- 
ing problem in Section [6] We present the numerical exper- 
iment results in Section [T] In Section |8] we answer several 
common questions on information-driven network calculus, 
and in Section [9] we introduce related work. The paper is 
concluded in Section [Tol 

2. BACKGROUND OF STOCHASTIC NET- 
WORK CALCULUS 

2.1 Notation 

We first introduce the notation and key concepts of stochas- 
tic network calculus [111 117] . Throughout this paper, we 
assume that all arrival curves and service curves are non- 
negative and wide-sense increasing functions. Following the 
convention, we use A{t) and A*{t) to denote the cumulative 
traffic arrives and departures in time interval {0,t], respec- 
tively, and use S{t) to denote the cumulative amount of 
service provided by the system in time interval (0,i]. For 
any < s < i, we denote A{s,t) = A{t) - A{s),A*{s,t) = 
A*{t) - A*{s), and S{s,t) = S{t) - S{s). We adopt A{0) = 
^*(0) = 5(0) = 0. 

We denote by J- the set of non-negative wide-sense in- 
creasing functions, i.e., 

:F={fi)--yO<x<y,0<f{x)<f{y)}, 

and by the set of non-negative wide-sense decreasing func- 
tions, i.e., 

^={f{-)--yO<x<y,0<f{y)<f{x)}. 

For any random variable X, its distribution function, de- 
noted by Fx = Prob{X < x}, belongs to and its comple- 
mentary distribution function, denoted by Fx = Prob{X > 
x}, belongs to T. 

2.2 Operators 

The following operations defined under the (min, +) alge- 
bra [4j[7l[l6] will be used in this paper: 

• The (min, +) convolution of functions / and g is 

{f®gm^ \ni{f{s)+g{t-s)}. (1) 

0<s<t 

• The (min, +) deconvoluUon of functions / and g is 

(/0 5)W =sup{/(t + s)-3(s)}. (2) 

• The (min, +) inf-sum of functions / and g is: 

(/0 5)(t) = inf{/(t + s) + <?(«)}. (3) 

2.3 Performance Measures, Traffic and Server 
Models 

The following measures are of interest in service guarantee 
analysis under network calculus: 

• The backlog B{t) in the system at time t is defined as: 

B{t) = A{t)-A*{t). (4) 



• The delay 'D{t) at time t is defined as: 

r>(t) = inf{r > : A(t) < + r)}. (5) 

Stochastic arrival curve and stochastic service curve are 
core concepts in stochastic network calculus with the for- 
mer for traffic modeling and the latter for server modeling. 
It is worth noting that the deterministic arrival curve traf- 
fic model and the deterministic service curve server model 
under (deterministic) network calculus are a special case of 
their corresponding stochastic definition. In the literature, 
there are several definition variations of stochastic arrival 
curve and stochastic service curve |11) such as: 

Definition 1. A flow A{t) is said to have a maximum- 
virtual-hacklog- centric stochastic arrival curve a ^ T with 
bounding function f £ ^, denoted by A ^m.b.c.< f,ct>, iff 
for all t > and all x > 0, there holds 

Prob{ sup sup [A{u,s) — a(s — u)] > a;} < f{x). (6) 

0<s<t 0<u<a 

Definition 2. A server is said to provide a flow A{t) 
with a stochastic service curve /3 £ J- with bounding function 
g £ denoted by S ^s.c< g,P >, iff for all t > and all 
x>0, there holds f77| / 

Prob{ sup [A 8 /3(s) - A*{s)] > x} < g{x). (7) 

0<s<t 

With the above definitions and their variations, various 
properties of stochastic network calculus, including stochas- 
tic backlog and stochastic delay bounds, have been proved 
(e.g., see [ill ED- 

3. AN INFORMATION-DRIVEN NETWORK 
CALCULUS 

3.1 Notation on Information 

Traditional stochastic network calculus uses the traffic ar- 
rival curve and the service curve to count for cumulative 
amount of traffic or service. If network nodes are informa- 
tion driven and can perform in-network processing, we need 
a translation from the amount of traffic to the amount of 
information in order to model information processing at net- 
work nodes. We hence develop a stochastic network calcu- 
lus dedicated to performance modeling of information-driven 
networks. 

Although entropy (or entropy rate) is broadly used to 
measure information of random variable (or random pro- 
cess) [6], to avoid various details in entropy estimation in 
specific applications [TS], we simply use the notation H{A{t)) 
to denote the information of A{t) but leave its practical 
meaning and calculation open to users. Nevertheless, we 
need to define basic properties of information to make fur- 
ther analysis possible. 

Definition 3. The information of a flow A{t) is denoted 
as H[A{t)) and has the following properties: 

1. H(0) = where denotes the null set; H{A{0)) = 0. 

2. H(A(t)) is a non-negative, non- decreasing function of 
timet, and for yt andO < s <t, H{A{t)) = H{A{s)) + 
H{A{s,t)) holds for the same flow, i.e., the flow from 
the same information source. 



3. For different fl,ows Ai{t),i = 1, . . . , N , 

N N 

J2H{Mt))>H{J2Mt)), (8) 

1=1 i=l 

where X^^i^i(^) means the superposition of fl.ows 
Ai{t),..\AN{t). 

The first property of information means tliat tiie infor- 
mation of a flow is initially zero. The second one means 
that the information of a flow does not become smaller as 
more data arrive and the information from the same flow is 
accumulative, i.e., H{A{s,t)) should be understood as the 
new information in the flow at time interval {s,t\. The third 
property means that the superposition of different flows do 
not increase the total information in the flows. Note that 
these three properties are consistent with the entropy defi- 
nition. 

For ease of exposition, we omit the time index whenever 
doing so will not cause confusion. 

Definition 4. Redundant information is a measure of 
the information redundancy in different flows, ^i, . . . ,An- 
It is defined in this paper as 

N N 

J(Ai;...;Aiv) = ^/f(AO-//(^AO, (9) 

and has the following properties: 

1. If A\,...,An are independent, I{Ai; . . . ; Am) equals 
0. 

2. For any i{^l,...,N), I{Ai; An) < H{Ai). 

Definition 5. Information of a set of data sources A —< 
Ai, . . . , Am >, 11(A), is defined as 

M M 

H{A) = H{J2 A^) = J2 H{A,) - Am) (10) 

■i=l 2=1 

3.2 Modeling Flow and Service with Respect 
to Information 

Definition 6. A flow A{t) is said to have an information 
stochastic arrival curve a £ J- with bounding function f £ 
JT, denoted by A ~i.s.a.< /, Q >, iff for all t > and all 
X > 0, there holds 

Prob{ sup sup [II{A{u,s))-a(s-u)] > a;} < f{x). (11) 

0<s<tO<ti<s 

Definition 7. A flow A{t) is said to have an lower - 
bounded information stochastic arrival curve 7 G with 
bounding function Q a T , denoted by A ^i.i.s.a< 6, J >, iff 
for all t > and all x > 0, there holds 

Prob{ inf inf [H{A{u, s)) -'y{s -u)] < x} < e{x). (12) 

0<s<t 0<u<s 

Definition 8. A server is said to provide a flow A{t) 
with an information stochastic service curve fi £ T with 
bounding function g £ T, denoted by S ^i.s.s.< g,P >, iff 
for all t > and all x > 0, there holds 

Prob{ sup [H(A(s)) ® [/3]"(s) - H{A*{s))] > x} < g{x) 

0<s<f 

(13) 

where [P^it) = ma.x{(3{t),x}. 




Figure 2: The illustration of performance measures 

Note that information service includes both information 
processing and information transmission. We also use the 
terms information service rate and information arrival rate: 

Definition 9. The (average) information service rate of 
a server following ~i.s.s.< g,l3 > is defined as limj^oo ^-7-^. 

Definition 10. The (average) information arrival rate 
of a flow following ^i.a.a. < f,a > is defined as limt_,oo —7-^ . 

In the rest of the paper, unless otherwise mentioned, ser- 
vice by default is referred as information service. 

3.3 Performance Measures 

The following definitions are used for information guaran- 
tee analysis: 

Definition 11. The information delay of input flow A{t) 
in a system at time t is defined as: 

D{t) = inf{r > : H{A{t)) < HiA'it + r))}, (14) 

where A* (t) is the output flow. 

Definition 12. The information backlog at time t in a 
system is defined as: 

B{t) = H{A{t))~H{A'{t)), (15) 

where A(t) and A*[t) are input flow and output flow, respec- 
tively. 

Definition 13. The information backlog within delay bound 
r(< D{t)) at time t is defined as: 

B{t,T) ^ H(A(t))- H{A*{t + T)). (16) 

To help better understanding, Figure[2]illustrates the above 
three definitions. 

3.4 Basic Properties of Information-Driven Net- 
work Calculus 

To make the paper concise and easy to follow, we put all 
proofs in Appendix El 

Lemma 1. For any random variables X and Y , and 

Vs > 0, if Fx{x) < f{x) and Fy{x) < g(x), where f,g £ T, 
then 

Prob{X + Y >x}<{f®g){x). (17) 



Lemma 2. For any random variables X and Y , and \/x > 
0, if Fx{x) < f{x) and Fy{x) < g{x), where f £ T,g £ T, 
then 

Prob{X-Y>x}<{fQg){x). (18) 

Theorem 1. (Superposition) Consider two flows Ai{t) 
and A2{t). Let A{t) denote the aggregate flow, i.e., A{t) — 
Ai{t) + A2{t). If for both flows Ai ^i.s.a.< .fi,ai >,i = 
1,2, and /(Ai; A2) ^i.i.s.a.< f, 7 >. then A ^i.s.a.< f,ce >, 
where f{x) = (/i(g)/2 06')(a:), anda{t) = ax(t) + a2{t)-'y{t). 

Theorem [T] means that if two flows follow i.s.a. curves, their 
information fusion (i.e., flow aggregation with information 
redundancy removed) also follows an i.s.a. curve. 

Theorem 2. (Concatenation) Consider a flow A(t) pass- 
ing through a network of N nodes in tandem. If each node 
provides service S* ^i.s.a.< 5*, >, i = 1, 2, . . . , A*', then the 
network guarantees to the flow a service S ^i.s.s.< g,P > 
with 

I3{t)=l3'- ®...®l3'^{t) 

g(x) = (g) . . .(^g^{x). 

Theorem [2] indicates that the service provided by an end-to- 
end path follows an t.s.s. curve if the nodes along the path 
follow i.s.s. curves. 

Theorem 3. (Output) Consider a node with input flow 
A. If A ~i.s.a.< /, a >, and the node provides the flow 
with service S ~j.s.s.< g,l3 >, then the output A* '~i.s.a.< 
f(Sg,a0l3>. 

Theorem [3] means that the input flow and the output flow 
both follow I.s.a. curves, if the service node follows an i.s.s. 
curve. 

The following theorem illustrates the service guarantee 
provided by a network node in terms of information backlog, 
information delay, and information backlog within a delay 
bound. 

Theorem 4. (Service guarantee) If the input flow has 
A '~^i.s.a.< f,a >, and the network node provides the flow 
with service S ~i.s.s< g,(3 >, then 

1. The information backlog B(t) of the flow at time t sat- 
isfies: for all t > and all x > 0, 

Prob{B{t) >x}<f®g{x-a(Z) /3(0)). (19) 

2. The information delay D(t) of the flow at time t sat- 
isfies: for all t > and all x > 0, 

Prob{D{t) > /l(Q^ [13]^)} < / ® g{x), (20) 

where a'^(t) = a{t) + x, [l3]'^{t) = max{/3(t), a;}, and 
h(a, (3) is the maximum horizontal distance between 
functions a and f) and is defined as 

h{a,(3) = sup{inf{r > : a(s) < (3{s + r)}}. 

s>0 

3. The information backlog within delay bound r(< D{t)) 
of the flow at time t satisfies: for all t > and all 
x>0, 

Prob{B{t,T) > x} < f ®g{x+ inf [/?(«) - a{v - r)]). 

(21) 

Note that a{t) =0 fort <0. 
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Figure 3: The architecture of information-driven 
networks 

4. SYSTEM MODEL OF INFORMATION- 
DRIVEN NETWORKS 

The architecture of information-driven networks is illus- 
trated in Figure O including information sources, the trans- 
portation paths, and the information destination (or the 
sink). To simplify presentation, we only consider one sink 
node. The model, however, could be easily extended to mul- 
tiple information sinks. 

4.1 The Model for Information Sources 

Assume that there are M information sources in the sys- 
tem, denoted as Ai, A2, . . . , Am , respectively. We assume 
that an information source Ai{i = 1, . . . , M) has a certain 
amount of information to send to the destination. We de- 
note the traffic sent from Ai as Ai{t) and the traffic finally 
arrives at the destination as ^^'(t), where hi is the number 
of hops of the path used for the information delivery from 
Ai to the destination. 

For the M information sources, we use I{Ai{t); . . . ,A'j{t)), 
1 < i < j < M to model the information redundancy among 
different data sources. 

4.2 The Model for Transportation Paths 

We assume that K paths, denoted as Li, . . . , Lk, from the 
set of information sources to the destination are used for in- 
formation delivery. In this paper, we only focus on K paths 
that are node-disjointed. When multiple paths share some 
common intermediate nodes, the number of possible schemes 
for information fusion becomes tremendous, in which case 
it is extremely difficult to derive the performance bounds. 
We leave the study of multiple paths sharing intermediate 
nodes as our future work. Nevertheless, we do not assume 
the data transmissions along the K paths are always inde- 
pendent. For example, in wireless networks even if two paths 
do not share common intermediate nodes, the transmissions 
along the two paths may interfere with each other. In short, 
we assume that the transmissions along different paths may 
impact each other, but information fusion is only performed 
at the entrance nodes and the sink node. As a reminder, the 
information service in Definition|8]includes both information 
processing and information transmission. 

In addition, we assume that an end-to-end path, once es- 
tablished, is fixed for the time period of interest. Networks 
in which an end-to-end path changes quickly over time are 
beyond the focus of this paper and are left as our future 
work. 

We denote a path Li of length hi (i.e., the hop count from 
the source to the destination is hi) as Li = (L°, L\, . . . , L^'), 



where denotes the j-th node of path i. Each node Lj 
provides an information stochastic service curve S ~i.s.s< 
gf,l3f > along the path. 

When two paths, Li = {L1, L\, . . . , L^'^) and Lj = {L^j,L^j, 
. . . , L^^ ) interfere with each other in data transmission, we 

define an hi x hj matrix [J^i^rn)],l ~ 0, . . . ,hi ~ l,m = 
1 to capture the transmission correlation along 



Ai(t) 



0, 



the two paths, where 1(i,m) is an information impairment 
process that should be integrated in the information service 
curves of the nodes L\ and L™. This information impair- 
ment process is introduced to reduce the information service 
rate along the paths involved. Note that L^' = l!^-' since 
we assume the same destination. 

To investigate the impact of transmission interference, the 
following theorem will be used to adjust the service rate of 
an impacted node. Its proof is provided in Appendix 1^ 

Theorem 5. (Service reduction with information im- 
pairment) Consider a network node providing a flow with 
service S ^i.s.s.< <?,/3 >• If the node is interfered with 
an impairment process I with information stochastic arrival 
curve T ~i.s.a.< a >, then the network node guarantees 
to the flow a service S ~i.s.s.< g ® f , (5 — a >. 

4.3 Problems of Interest 

Thus far, we have presented a calculus and an abstract 
model for information-driven networks. Among many inter- 
esting problems in this analytical framework, we are inter- 
ested in the following key questions: 

1. What are the stochastically achievable information de- 
livery rates of a network? The answer to the question 
discloses the limit of information delivery in the sys- 
tem. 

2. Given a set of information sources and an end-to-end 
delay bound, can the total information be delivered 
to the destination with a high probability, and how? 
The answer to the question provides insights on the 
design of scheduling algorithm for information-driven 
networks. 

In this paper, we focus only on the feasibility of infor- 
mation delivery and performance bounds, but intentionally 
avoid the study of many interesting optimization problems 
within the analytical framework. This is because the main 
purpose of this paper is to present a new analytical approach 
and its application for information-driven networks and also 
because traditional measurements on resource overhead have 
not been well defined in information-driven networks. For 
instance, we need to mathematically formulate the compu- 
tational overhead for information processing. We leave these 
aspects as our future work. 
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Figure 4: An information splitter for servers in par- 
allel 



Definition 14. A weighted information splitter is a sched- 
uler that splits an input flow A{t) into multiple informa- 
tion exclusive sub-flows Ai{t), . . . , ^iv(t), with each assigned 
a weight Wi and served by an information processing node 
S^(i = 1,...,N), respectively. At any time instant t, the 



sub-flow assigned to satisfies H{Ai{t)) 



■H{A{t)). 



Theorem 6. (Parallel servers) Consider a flow A{t) 
passing through a weighted information splitter and then a 
network of N nodes in parallel (Figure^. Assume that all 
nodes are work conserving, i.e., they cannot become idle if 
there is information waiting for service. Assume that each 
node provides service 5" ~i.s.s.< (?*,/3' > (i = 1,2, ...,A'^) 
and the weight of the sub- flow to at time t is set to /3^{t). 
The whole system guarantees to the flow a service S ~i.s.s.< 
g,P > with 



m = 

g{x) =g^ d 



It is worth highlighting that Theorem[S]is intended to dis- 
close a network's achievable information delivery rate, and 
for this reason it assumes an ideal situation in which infor- 
mation could be split in infinitesimal amounts. In practice, 
however, such an assumption may not hold and the net- 
work's actual information delivery rate would be lower. 

Equipped with the theorems in this paper. Algorithm Rate- 
Cal, shown in Figure O can be used to search for stochas- 
tically achievable information delivery rates of a network. 
It uses a brutal-force search and has a time complexity of 
0(2^), where K is the number of node-disjoint paths from 
the sources to the destination. The algorithm is not scal- 
able with K, but practically K is usually not very large. In 
addition, it is very easy to reduce the complexity by con- 
sidering the stochastic relationship of i.s.s. curves. A curve 
^i.s.s.K g^ , > is considered better than another curve 
^r.s.s.< g'^,13^ > lif for yt,x > 0, I3^{t) > I3^{t) and 
g^{x) < g^{x). In this case, we can ignore the curve to 
reduce the search space. 



5. STOCHASTICALLY ACHIEVABLE INFOR- 
MATION DELIVERY 

We answer the first question in this section. For this, we 
need to investigate the information service guarantee pro- 
vided by multiple servers in parallel, as shown in Figure 2] 
For simplicity, we assume a fluid model in which information 
could be split in infinitesimal amounts. This constraint will 
be relaxed in the next section. 



Example 2. We use a simple example to illustrate the 
calculation of stochastically achievable information delivery 
rates of a network. Assume that a network has three par- 
allel end-to-end paths, Li,i = 1,2,3. Assume that path Li 
includes i{— 1,2,3) nodes, respectively. Assume that the 
transmissions along different paths do not interfere with each 
other. Also assume that all nodes provide the same infor- 
mation stochastic service following '~^i.s.s.< e~^,rt >. Run- 
ning Algorithm RateCal, we obtain seven i.s.s. curves, as 



Input: A network with K parallel paths (Figure |4]); 

i.s.s. curve of each node; 

A set of impairment matrices; 
Output: a list of stochastically achievable services; 
Method: 

1: FOR any subset of the K paths, {Li, . . . ,Lj}, 

where ^<i<j<K{ 
2: FOR n = i TO j{ 

3: FOR each node along path L„ , if its transmission 
is interfered by another path in the subset, 

4: Adjust the node's i.s.s. with Theorem [5] and 

the i.s.a. curves in relevant impairment matrices; 

5: Calculate the i.s.s. of path Ln with Theorem[2] 
} 

6: Output the i.s.s. of subset {Li, . . . ,Lj} with Theorem [S] 
} 



Figure 5: RateCal: An algorithm to calculate 
stochastically achievable information delivery rates. 




Figure 6: Stochastically achievable service curves in 
the example network 



shown in Figure\^ For instance, based on the i.s.s. curve 
we can see that for a (deterministic) infor- 
mation arrival following ^i.s.a.< 0, 3rt >, the network can 
guarantee its delivery such that up to time t, the probability 
that the output information is less than 3rt — 24 is smaller 

-24 

than 0.1(Ri 6e~B-). 

6. TRANSMISSION SCHEDULING 

Although the previous section provides the answer to stochas- 
tically achievable information delivery rates, the results are 
based on an ideal assumption, that is, the information could 
be split in infinitesimal amounts. In practice, however, we 
are often met with the situation that information cannot be 
split arbitrarily. In this sense, the results of previous section 
only provide theoretical upper bounds on stochastic infor- 
mation delivery rates, which may not be practically feasible 
for a given request. We hence need to study the following 
problem: Given a set of information sources and an end-to- 
end delay bound, can the total information be delivered to 
the destination with a high probability (e.g. no lesser than 
p)? If yes, how? if not, what is the percentage of the infor- 
mation that could be delivered and in what probability? 



Definition 15. A transmission schedule for a set of in- 
formation sources A =< Ai, . . . , Am > over a set of paths 
L =< Li, . . . , Lk > is denoted as a tuple < A, L,n > , where 
TT is a mapping function from A to L, andTT{Ai) = Lj means 
that data from source Ai{l < i < M) ts sent over path 
Lj{l < J < K) m the transmission schedule. 

Definition 16. Information delivery ratio within delay 
t: For a set of information sources A = (Ai, . . . ,Am) sent 
from M sources to a destination, the information delivery 
ratio at time t, TZ{t), is defined as 



n{t) 



H{A*it + r)) H{A(t))^B{t,T) 



H{A{t)) 



H{A{t)) 



(22) 



where A* (t + r) represents the set of traffic that has arrived 
at the information sink at time t + t. 

The scrutinizing readers may have realized that H{A*{t-\- 
r)) is not well defined because to get it needs calculation 
of I{Al; . . . ; A\i), which has not been defined. The follow- 
ing theorem discloses the relationship of information redun- 
dancy before and after information delivery and can be used 
to overcome this difficulty. 

Theorem 7. Consider a set of information sources A — 
{Ai, . . . , Am). Split the set into two subsets Ai — {Ai, . . . ,Ai 
and A2 — (^i+i, . . . , Am), and transmit Ai and A2 simul- 
taneously along two separate paths at time t. Assume that at 
time t + T, both H{Ai) and H{A2) have been received, i.e., 
H{AUt -f r)) = H{Ai{t)) and H{Al{t -h r)) = H{A2{t)). 
We have I{Al{t + r); A^t + r)) = /(Ai(i); A2(t)). 

Note that Theorem [7] holds no matter whether the infor- 
mation redundancy within each set Ai,i = 1,2 is removed 
or not during the transmission. 

With Definitions [15] and 1161 the formal formulation of 
the problem becomes: Given a set of information sources 
A — {Ai, . . . ,Am), a set of separate transportation paths 
Li,...,Lk, an end-to-end delay bound D, and a proba- 
bility p, is there a transmission schedule to deliver the in- 
formation from the sources to the destination within time 
delay D with a probability no smaller than p? It is easy 
to see that this problem is NP-hard even without consid- 
ering the information redundancy among the information 
sources, because the problem can be transformed into a "bin- 
packing" problem |9 , which is NP-hard. We hence propose 
a method, called best-fit-largest-redundancy(BFLR) algo- 
rithm, to search for an approximate solution. An error may 
occur if the algorithm returns a negative answer to the ex- 
istence of a feasible transmission schedule, but a feasible 
transmission schedule does actually exist. Nevertheless, due 
to the NP-hardness of the problem, such an error cannot 
be avoided unless both K and M are very small so that we 
can use brutal-force search. Our later experimental study 
shows that BFLR can find almost all feasible transmission 
schedules. 

The basic idea of the BFLR algorithm is to use the stochas- 
tically achievable service rates obtained by Algorithm Rate- 
Cal as the guideline in the selection of a feasible subset of 
paths, and combine information sources that have the largest 
information redundancy and can best fit to a transportation 
path. The detailed steps of BFLR is illustrated in Figure [T] 

When the BFLR algorithm returns no, it may be because 
either (1) the information service rate of a selected path is 



Input: A network with K parallel paths (Figure |4]), each node's service curve, and impairment matrices; 

A set of information sources A = {Ai, . . . ,Am) and related information redundancy; 

End-to-end delay bound D; 

The violation probability p; 
Output: Feasible transmission schedules, or no feasible transmission schedule; 
Method: 

1: Call Algorithm RateCal to obtain all stochastically achievable service rates, each characterized by ^i.s.s.< g^,l3^ >; 

2: Remove the achievable service rates if its rate is smaller than the information arrival rate of H{A); 

3: Sort the remaining achievable service rates in the decreasing order of rate; 

4: Sort Ai in the decreasing order of their information arrival rates; //Note: Ai ~i.s.a.< f^,a^ >■ 

5: FOR each remaining achievable service rate in the sorted order { 

6: Find the subset of paths corresponding to the current achievable service rate; 

/ / Note: A feasible service rate represents a subset of paths. 
7: Sort the paths in the decreasing order of their service rates; 

8: AS = 0; // Note: AS is an initially empty set to store information sources that could be combined together. 
9: FOR each path in the subset { 

10: Find the information source whose information arrival rate best fits the current path; 

/ / Note: Best-fit means that the source's information arrival rate is smaller than the service rate of the path, 
but is the largest among all non-processed information sources. 
11: Use Theorem|4l(2) to check if the information source can be delivered with the current path to meet the requirement; 
12: Go to the next path if best-fit cannot be found; 

13: Label the best-fit information source as processed and insert it into ^45; 
REPEAT { 

14: Find the non-processed information source that has the largest redundancy with ^45'; 

15: } UNTIL AS cannot be delivered with the current path to meet the delay requirement; 
16: IF all information sources have been processed, output the transmission schedule; 
ELSE Go to the next path. 

} 

17: Go to the next achievable service rate; 

18: Return NO if all achievable service rates have been checked but no feasible transmission schedule has been found. 
19: } 



Figure 7: BFLR: An algorithm to search for feasible transmission schedules. 



smaller than the combined rate of a subset of sources, or (2) 
the information service rate of a selected path is larger than 
the combined rate of a subset of sources but the information 
cannot be delivery in the (tight) delay bound. In the first 
case, no feasible transmission schedule exists even if we en- 
large the delay bound. In the latter case, the problem that 
we need to answer is: what is the information delivery ratio 
TZ within the delay bound D and in what probability? The 
problem can be answered by revising the BFLR algorithm 
and using Theorem 3] (3). Basically, we should remove Line 
18 of the algorithm, and in Line 11 use Theorem |4](3) to 
check the information backlog within the delay bound and 
the corresponding stochastic bounding function. In addi- 
tion, the output (Line 16) should be changed accordingly. 
We omit the details to avoid triviality. 

7. CASE STUDY 

7.1 Network Configuration 

We use an exemplary information-driven network to illus- 
trate how the results in this paper can be used for perfor- 
mance evaluation in practice. 

Information sources. We consider a network consisting 
of M = 9 information sources. These sources are partitioned 
into 3 groups with sources 1 3 in Group 1, sources 4^6 
in Group 2, and sources 7 ~ 9 in Group 3. Information from 
the same group exhibits spatial correlation, but information 



from different groups is assumed to be independent. 

Temporal correlation. Assume that the information of 
each source is collected by periodically sampling a stationary 
Gaussian stochastic process. Concretizing the information 
H{-) of the flow Ai generated by source i to the Shannon 
entropy function Q it yields for discrete time t [6] : 

HiA(t)) = m(t) = ^log{2neY\Cl'^\, t = l,2,... (23) 

where C|*' is the txt covariance matrix for the flow of source 
i and is specified by the temporal covariance function ri(r), 
i.e., the matrix element c\'^\j,k) = Vi{k — j), where 1 < 
j, k < t. Here, we adopt the typical exponential covariance 
function [24] : 

r,(r) =a?e-l^^/"', r = 0, ±1, ±2, ... (24) 

where erf is the variance of flow At and r]i is a constant. 
Assume that each source generates messages at a constant 
interval 5. By applying Equation (|24|l in (|23|l , the stochastic 
arrival curve of flow Ai can be specified in the continuous 
time t as Ai ~i.s.a.< 0, Qi >, where 



^Shannon's entropy funct ion i s consistent with our definition 
of information in Section 13.11 



a^{t) = (25) 
^log(27rea?),0<t <5 

^ log(2^ea?(l - e-2/"-)) - i log(l - e-^/^^),t> 5 

Spatial correlation. In the same source group, we model 
the information redundancy of the sources using a spatial 
correlation model similar to that in [25 . Specifically, we 
assume that for sources in the same group, 

i i 

where are constants and depend on the sources' loca- 
tions and the adopted spatial model |24) . In our study, we 
use the same setting of information redundancy for all three 
groups. Specifically, for the three sources in a group, de- 
noted as A\, A-i, Az, we set: 

+ Aa) = H{A^ + Az) = H{Ax -h A3) = \.m{A^) 
H{A^^ A2^ Az] = 2AH{Ax) 

While the above parameter settings are kind of arbitrary, we 
stress that using other parameters has no impact on the ef- 
fectiveness of our analytical framework since users can adopt 
any other temporal and spatial correlation models different 
from our example scenario in their analysis. 




Figure 8: An example network 

Transportation paths. As shown in Fig. [S] the col- 
lected information is transmitted to a single information 
sink through four parallel end-to-end paths, Li,i — 1,2, 3, 4. 
On each path Li, there are i nodes. Assume that all net- 
work nodes provide information service following ~i.s.s.< 
e~^,r{t — d) >, where r is the average information service 
rate and d is the per-hop delay. Also assume that the first 
node in Li and the first node in L2 are subject to correlated 
impairment, which follows ~i.s.£i.< 4e~^''*,r(t — d)/5 >. 
Similarly, assume that the second node in L3 and the sec- 
ond node in L4 are subject to correlated impairment, which 
follows ~i.s.a.< 3e-^/^r■(^ - d)/3 >. 

Note that Example [1] is a simplified special case of this 
case study. 

7.2 Numerical Results 

We would like to know whether the information network 
could deliver the information from sources to the destination 
within a given delay bound with at least a certain probabil- 
ity. If feasible transmission schedules cannot be found, we 
also want to know the information delivery ratio with at 
least a certain probability. Since the performance measures, 
e.g., delay and information delivery ratio, are time depen- 
dent, we calculate their maximum value to remove the time 



Table 1: Information stochastic service curve of each 
path 



Path 


w/o impairment 


w/ impairment 


Li 


< e''^,r{t - d) > 


< 5e-"''Mrt - |rd) > 


L2 


< 2e-^''^r(^-2d) > 


< 6e-"''Mrt- |rd) > 


Lz 


< 3e-"■'^r(^-3d) > 


< Se-'^Mrt- |rd) > 


Li 


< Ae"'/^,r{t-'id) > 


< ee-'/Mri - ^rd) > 



dependancy in this case study. For example, the delay is 
calculated as supf^Q D{t). 

In the example study, we select the rate parameter r = 
8 kbps, per-hop delay d = 7.5 ms. For every information 
source, we set S = 100 ms and rji — 100. With reference to 
Equation (|25|l , ai is set such that the long-term information 
rate of a source « 2.33 kbps. Considering the spatial corre- 
lation modeled, the total long-term information arrival rate 
of the 9 sources amounts to 16.78 kbps. 

First, we calculate the information stochastic service curve 
of each path and list the results in Table [1] 

To facilitate understanding, we list the subsets of paths 
found by Algorithm BFLR that have total information ser- 
vice rate larger than the total information arrival rate, i.e., 
the intermediate result after Step 2 of Algorithm BFLR: 



1. 


Li 


+ L2 


+ L3 


'"^i.s.c 


< 146-*/", 


i^rt 


-frd> 


2. 


Li 


+ L2 


+ L4, 


'"^i.s.c 


< 15e-'''^^ 


'4rt 


-■frd> 


3. 


Li 


+ L3 


+ L4, 


'"^i.s.c 


< 12e-'/i2^ 


Irt- 


-frd> 


4. 


L2 


+ L3 


+ L4, 


'"^i.s.c 


< 13e-'/l^ 


Irt- 


- f rd> 


5. 


Li 


+ L2 


+ L3 


+ Li 


"^i.s.c. ^ 22c 


-t/22 

1 


44 , 134 
is"- 15 



The first item means that if path 1, path 2, and path 3 are 
used, the achievable stochastic information service follows 
^i.s.s.< 14e"'/^*, ^rt - ^rd >. The explanation of other 
items is similar. 

We can search for feasible transmission schedules using 
Algorithm BFLR for different delay bounds (e.g. 35 ms and 
45 ms) and for different violation probabilities (e.g., Prob = 
0.001 and Prob = 0.0001). The results are shown in TableU 
In the table, "X" denotes that the correspondent path set is 
not feasible. "Ai.j" stands for the source j in Group i. As an 
example, the first three lines on the third column mean that 
if we send information from sources Al.l, A1.2, A1.3 along 
path Li, information from sources A2.1, A2.2, A2.3 along 
path 1/2, and information from sources ASA, A3.2, A3.3 along 
path L3, the total information can be received within delay 
bound 35 ms with a violation probability no larger than 
0.001. Because information from the same source group ex- 
hibits the largest redundancy, the results demonstrate that 
Algorithm BFLR is capable of finding a transmission sched- 
ule so that information from the same source group is fused 
together to remove information redundancy. 

As discussed before, when the BFLR algorithm returns no, 
it may be because either (1) the information service rate of a 
selected path is smaller than the combined rate of a subset of 
sources, or (2) the information service rate of a selected path 
is larger than the combined rate of a subset of sources but 
the information cannot be delivery within the delay bound. 
We find that the two subsets of paths, < Li,Lz,L4 > and 



Table 2: Results of BFLR: feasible transmission 
schedules 



Prob 1 Delay bound 


35 lus 


45 ms 


.1% 


(I/l, 1/2, 1^3) 


Ll : ^1.1 ~ 1.3 


Ll : 41.1 ~ 1.3 


L2 : A2.1 ~ 2.3 


L2 : A2A ~ 2.3 


L3 : A3.1 ~ 3.3 


L3 : A3A - 3.3 


(I/l, 1/2, 1^4) 


Ll : ^1.1 ~ 1.3 


Ll : AlA ~ 1.3 


L2 : A2.1 ~ 2.3 


L2 : A2A ~ 2.3 


L4 : A,i.l ~ 3.3 


L4 : ASA ~ 3.3 


(I/l, 1/3, 1^4) 


X 


X 


(1/2, 1/3, L4) 


X 


X 


(Ll, L2, 1/3, L4) 


X 


Ll 


yll.l ~ 1.3 


L2 


yl2.1 ~ 2.3 


L3 


A3A ~ 3.2 


L4 


A3.3 


.01% 


(I/l, 1/2, 1^3) 


Ll : ^1.1 ~ 1.3 


Ll 


AlA ~ 1.3 


L2 : ^2.1 ~ 2.3 


L2 


A2A ~ 2.3 


L3 : ^3.1 ~ 3.3 


L3 


ASA - 3.3 


(I/l, 1/2, L4) 


X 


Ll 


yll.l ~ 1.3 


L2 


A2A ~ 2.3 


L4 


43.1 ~ 3.3 


(I/l, I/3, 1^4) 


X 


X 


(1/2, 1/3, 1^4) 


X 


X 


(Ll, 1/2, 1/3, L4) 


X 


Ll 


AlA ~ 1.3 


L2 


A2A ~ 2.3 


L3 


ASA ~ 3.2 


L4 


AS.S 



< L2, L3, L4 >, belong to the former case, meaning that no 
feasible transmission schedule exists even if we enlarge the 
delay bound. The other "X"s in Table [2] belong to the latter 
case, where we want to know the information delivery ratio 
within the delay bound. 

By setting even tighter delay bounds, 15 ms and 20 ms 
for example, we can make the rest three subsets of paths 
in Table [2] belong to the latter case. The results regard- 
ing information delivery ratio are shown in Table O Since 
the results are calculated using Theorem U (3) and Equa- 
tion (I22[) . the results are explained in the following way: the 
probability that the information delivery ratio within a delay 
bound is smaller than a value is not larger than a violation 
probability. For instance, the first line of the third column 
indicates that using paths Li,L2, and L3, the probability 
that the information delivery ratio within the delay bound 
of 15 ms is smaller than 56.4% is not larger than 0.1. The 
"twist" logic implies that using paths Li, L2, and L3, with a 
probability no less than 0.9, the information delivery ratio 
within the delay bound of 15 ms is at least 56.4%. 



Table 3: Information delivery ratio under small de- 
lay constraints 



Prob /Delay bound 


15 ms 


20 ms 


0.1 


(Li,L2,L3) 


56.4% 


63.8% 


(Ll, L2, L4) 


50.6% 


56.1% 


(Ll, L2, L3, L4) 


50.2% 


57.8% 


0.15 


(Li,L2,L3) 


59.7% 


67.1% 


(Ll, L2, L4) 


53.9% 


59.5% 


(Ll, L2, L3, L4) 


54.2% 


61.8% 



8. FURTHER DISCUSSION 

In this section, we briefly answer three common questions 
readers may have on the information-driven network calcu- 
lus. First, why do we use our own definition on information 
and redundant information instead of using entropy and mu- 
tual information [51 [21] directly? Second, why do we use 
stochastic service curves and stochastic arrival curves in- 
stead of the deterministic ones [T^? Third, there are other 
ways to define stochastic curves in the literature [111 I17| . 
why do we use the particular ones as in this paper? 

Regarding the first question, we emphasize that our defi- 
nition does not conflict with Shannon's entropy and entropy 
rate [21 [22]. In a calculus for performance modeling, all we 
care is the properties of the information in a flow instead 
of its exact calculation. Due to this reason, we intention- 
ally use generic notations, H and /, to denote information 
and redundant information, which meet certain (intuitive) 
criteria, but leave their practical meaning and calculation 
open to users. This is to avoid different details in entropy 
estimation in particular applications |14l 115) . 

Regarding the second question, it is commonly known that 
deterministic network calculus 16 focuses on the worst case 
analysis and as such the performance bounds obtained with 
deterministic network calculus is usually very loose. In ad- 
dition, in many applications, there is uncertainty in both 
information processing and information transmission due to 
limited computational capacity (e.g., sensor nodes) and un- 
reliable transmission links (e.g., wireless networks). For the 
information arrival process, although the entropy of a sta- 
tionary process grows linearly with t at a rate called entropy 
rate [6] and thus its information arrival curve could be mod- 
eled with a deterministic arrival curve, practically we usu- 
ally do not know the entropy rate in advance and have to 
resort to sample entropy [141 [15] , which exhibits stochastic 
features. 

Lastly, it has been observed that there are other forms 
of definitions on stochastic service curves and stochastic ar- 
rival curves [111 117) . for instance, the definitions with the 
sup removed or with single sups [11]. There are discussions 
in [111 [T7] on the difficulties of deriving the basic properties 
of the calculus if other forms of definitions are used. Never- 
theless, this does not necessarily mean that other definitions 
are improper. Actually we can transform one type of defini- 
tion to another, but the constraint on the bounding function 
usually needs to change and the corresponding results have 
more complex form. Definitions [1] and [2] have intentionally 
been chosen to ease the exposition. 



9. RELATED WORK 

To the best of our knowledge, there are currently no an- 
alytical tools available for systematic performance study of 
information driven-networks. The framework proposed in 
this paper is related to network calculus, particularly its 
stochastic branch: stochastic network calculus. Since its in- 
troduction in early 1990s 7 , network calculus has attracted 
a lot of research attention and evolved along two tracks - de- 
terministic and stochastic. Excellent books summarizing re- 
sults for deterministic network calculus are available (e.g. [H 
I16[). For stochastic network calculus, its research can also 
be tracked back to early 1990s (e.g. [13] [25]). However, due 
to some difficulties specific to stochastic networks 1111 117| , it 
is only in recent years when critical network calculus proper- 



ties such as concatenation property [5] [TT] and independent 
case analysis [11] have been proved for stochastic network 
calculus. 

The relevance of the present paper to stochastic network 
calculus lies in the analogy between the various models and 
properties defined or derived in this paper for information- 
driven networks, and the corresponding models and proper- 
ties under stochastic network calculus. However, we want 
to stress that there is significant difference between the in- 
formation network calculus proposed in the paper and the 
current development of stochastic network calculus, in that 
their targeted networks are different. The former is for 
information-driven networks where the key concern is about 
the quality of information delivery; the latter and network 
calculus in general are for networks where traffic is the focus. 
Due to this fundamental difference, special care has to be 
taken in developing the calculus for information-driven net- 
works. For example, information dependence and informa- 
tion redundancy are unique concepts for information-driven 
networks. While two traffic flows may be independent, they 
can carry the same or highly correlated information as dis- 
cussed in Example [T] 

In information-driven networks, in-network information 
processing is likely performed. From this in-network pro- 
cessing viewpoint, the present paper is related to [8] and [21| . 
In [8], scaling functions are used to model the relationship 
between the input traffic and the output traffic of a network 
element that processes the traffic. Based on the proposed 
scaling server model, [S] extends the deterministic network 
calculus by considering data scaling in networks with in- 
network data processing. In [2lj, how the scaling elements 
can be shifted across multiplexers in sensor networks is stud- 
ied, which enables worst-case analysis of traffic delay and 
backlog in such networks. Note that in [HlHI], information 
processing only applies to intra-flow data, leaving inter-flow 
processing un-considered. However, in our work, both intra- 
flow processing and inter-flow processing are considered. In 
addition, the essential focus of [8] and [21] is on traffic, while 
our focus is on information carried by traffic. 

In [T], the problem of network information flow is intro- 
duced. The focus of [T] is on a special type of in-network 
processing which is called network coding. With network 
coding, information is diffused through the network from 
the sources to the destination (s) and sources of flows may 
be jointly coded to achieve optimality in addressing the net- 
work information flow problem. An excellent introduction 
to network coding theory is available [261. The present 
paper is related to Tj and network coding literature in that 
they all take information as the central point of study. 

Note that with network coding, sources may be coded 
jointly. In such cases, focusing on traffic in the analysis is 
no more applicable, since the output flow is not a simple 
scaling of the input flow or the aggregate of input flows. 
For example, suppose flows /i and /2 are two bit streams. 
Applying exclusive-OR to the corresponding bits in them 
results in a new flow /i © /2. In the current network cal- 
culus literature including [8] and [21], traffic amount and 
(traffic) service amount are the concern. In the example, for 
traffic, ^/ie/2(^) = — ^/2(*)- service to each 

individual flow or superposition of these two flows, however, 
the current network calculus approach provides no answer, 
since no corresponding output of /i or /2 is found out of 
the exclusive-OR operation. With the proposed calculus in 



this paper, the exclusive-OR operation may be modeled as 
an information server providing a deterministic information 
service curve f3{t) = oo to either flow fi, i = 1,2. Then, 
based on results in this paper, particularly Theorem [3] we 
can say that the output flow /i © /2 preserves the informa- 
tion of either flow fi, i = 1,2. 

Finally, we would like to highlight that network coding 
and other in-network processing techniques signiflcantly com- 
plicate network performance analysis both in terms of traffic 
service guarantees and in terms of information service guar- 
antees. While a lot of network calculus results are available 
for potential use in analyzing traffic service guarantees in 
such networks (e.g., [IS]), no previous work has been found 
for analyzing information service guarantees in these net- 
works. We believe the proposed calculus makes a critical 
step and sheds light on further development to address this 
challenge. 

10. CONCLUSIONS AND FUTURE WORK 

When network nodes become information aware and are 
capable of information processing, the network should not be 
simply considered as a data transportation tool but rather 
an information processing system. QoS guarantee in this 
type of networks should be measured with respect to quality 
of information instead of just data throughput or bounded 
(end-to-end) packet delay. Although substantial research 
has been done in information processing for speciflc appli- 
cations, a systematic analytical framework for performance 
modeling and evaluation of information-driven networks re- 
mains blank. To the best of our knowledge, this paper is 
the flrst attempt to flU the gap by developing an analytical 
approach for performance evaluation of information-driven 
networks. We proved the basic properties and service guar- 
antee of information-driven network calculus, derived the 
stochastically achievable information delivery rates, and in- 
vestigated the problem of information transmission schedul- 
ing. 

This paper focuses mainly on the development of a new 
analytical framework, within which many interesting prob- 
lems demand further investigation. These problems, for ex- 
ample, include the scheduling problem with a generic net- 
work topology for information fusion, the various optimiza- 
tion problems in communication as well as computation re- 
source allocation, and the performance bounds if informa- 
tion error and lossy models are introduced. 

Acknowledgment 

This work is supported by a fellowship from the Centre for 
Quantiflable Quality of Service in Communication Systems 
at the Norwegian University of Science and Technology, and 
by the Discovery Grant of Natural Sciences and Engineering 
Research Council of Canada. 

11. REFERENCES 

[1] R. Ahlswede, N. Cai, S. Li, and R. Yeung. Network 

information flow. IEEE Trans. Information Theory, 

46(4):1204-1216, 2000. 
[2] R. Ahmed and R. Boutaba. Distributed pattern 

matching: A key to flexible and efficient p2p search. 

IEEE Journal on Selected Areas in Communications, 

25(l):73-83, January 2007. 



[3] G. Armitagc. Quality of Service in IP Networks. 

Macmillian TP, 2000. 
[4] C.-S. Chang. Performance Guarantees in 

Communication Networks. Springer- Verlag, 2000. 
[5] F. Ciucu, A. Burchard, and J. Liebeherr. A network 

service curve approach for the stochastic analysis of 

networks. IEEE Trans. Information Theory, 

52(6):2300-2312, June 2006. 
[6] T. M. Cover and J. A. Thomas. Elements of 

Information Theory, Second Edition. John Wiley & 

Sons, 2006. 

[7] R. L. Cruz. A calculus for network delay, part I: 
network elements in isolation. IEEE Trans. 
Information Theory, 37(1):114-131, Jan. 1991. 

[8] M. Fidler and J. B. Schmitt. On the way to a 

distributed systems calculus: An end-to-end network 
calculus with data scaling. In Proc. ACM 
SIGMETRICS/Performance 2006, pages 287-298, 
2006. 

[9] M. Carey and D. Johnson. Computers and 

Intractability: A Guide to the Theory of 
NP-Completeness. W.H. Freeman and Company, 1979. 

[10] C. Intanagonwiwat, R. Covindan, and D. Estrin. 
Directed diffusion: A scalable and robust 
communication paradigm for sensor networks. In 
Proceedings of ACM MobiCOM '00, pages 56-67, 
Boston, August 2000. 

[11] Y. Jiang. A basic stochastic network calculus. In 

Proceedings of ACM Sigcomm 06, pages 123-134, Pisa, 
Italy, September 2006. 

[12] D. Kundur, Z. Liu, M. Merabti, and H. Yu. Advances 
in peer-to-peer content search. In Proceedings of IEEE 
International Conference on Multimedia and Expo 
(ICME), 2007, pages 404-407, Beijing, China, July 
2007. 

[13] J. Kurose. On computing per-session performance 

bounds in high-speed multi-hop computer networks. In 
ACM SIGMETRICS'92, 1992. 

[14] A. Lakhina, M. Croella, and C. Diot. Mining 
anomalies using traffic feature distributions. In 
Proceedings of ACM SIGCOMM 05, pages 217-228, 
Philadelphia, USA, August 2005. 

[15] A. Lall, V. Sckar, M. Ogihara, J. Xu, and H. Zhang. 
Data streaming algorithms for estimating entropy of 
network traffic. ACM SIGMETRICS Performance 



Evaluation Review, 34(1):145-156, June 2006. 

[16] J. Le Boudec and P. Thiran. Network Calculus: A 
Theory of Deterministic Queuing Systems for the 
Internet. Springer- Verlag, 2001. 

[17] C. Li, A. Burchard, and J. Liebeherr. A network 
calculus with effective bandwidth. IEEE/ACM 
Transactions on Networking, 15(6):1142-1453, 
December 2007. 

[18] C. Liu, K. Wu, and J. Pei. An energy efficient data 
collection framework for wireless sensor networks by 
exploiting spatiotemporal correlation. IEEE 
Transactions on Parallel and Distributed Systems 
(TPDS), 18(7): 1010-1023, July 2007. 

[19] A. Mahmino, J. Lecan, and C. Fraboul. Guaranteed 
packet delays with network coding. In First IEEE 
International Workshop on Wireless Network Coding, 
pages 1-6, San Francisco, June 2008. 

[20] S. Pattem, B. Krishnamachari, and R. Covindan. The 
impact of spatial correlation on routing with 
compression in wireless sensor networks. ACM 
Transactions on Sensor Networks (TOSN), 4(4):article 
24, August 2008. 

[21] J. B. Schmitt, F. A. Zdarsky and L. Thiele. A 

comprehensive worst-case calculus for wireless sensor 
networks with in-network processing. In Proceedings of 
IEEE RTSS 2001, pages 193-202, Tucson, Arizona, 
December 2007. 

[22] C. E. Shannon. A mathematical theory of 

communication. Bell System Technical Journal, 
27:379-423,623-656, 1948. 

[23] R. Szewczyk, J. Polastre, A. Mainwaring, and 

D. Culler. Lessons from a sensor network expedition. 
In Proceedings of the First European Workshop on 
Sensor Networks (EWSN), pages 307-322, Berlin, 
Germany, January 2004. 

[24] M. Vuran, O. B. Akan, and I. Akyildiz. 

Spatio-temporal correlation: Theory and applications 
for wireless sensor networks. Computer Networks, 
45(3):245-259, June 2004. 

[25] O. Yaron and M. Sidi. Performance and stability of 
communication network via robust exponential 
bounds. IEEE/ ACM Trans. Networking, l(3):372-385, 
June 1993. 

[26] R. Yeung, S. Li, N. Cai, and Z. Zhang. Network 
Coding Theory, now Publishers, 2006. 



APPENDIX 

A. PROOFS OF RESULTS 

Proof of Lemma [21 For any random variables X and Y , and 
any X > 0, z > 0, {X - r > x} n {X < X + 2} n {y > ^} = 0, 
where denotes the null set. We thus have 

{X - Y > x) <Z {X > X + z} \J {Y < z}, 

which means 

Prob{X -Y >x} < Prob{X > x + z} + Prob{Y < z}. 

Since the above inequality holds for all z > 0, we get 

Prob{X -Y >x}< inf [Prob{X > x + z} + Prob{Y < z}], 
z>0 

with which and Fx{x) < fix) and Fy{x) < g{x), where / G 
J^, g £ JF, the result is proved. ■ 

Proof of Theorem in (Superposition). Based on the prop- 
erties of information and redundant information, we have 

sup [H{A{u, s)) — (qi (s — Ji) + a2{s — u) — 7(5 — «))] 

0<u<s 

= sup lH{Ai(u,s)) + H(A2{u,s)) - I{Ai;A2)(u,s)- 

0<u<s 

(ai (s — u) + Q2 (s — u) — 7(s — u))] 

< sup [H{Ai{u, s)) — ai(s — u)] + sup [H{A2(u, s)) — a2{s ~ u) 

0<u<s 0<u<s 

- inf [I{Av,A2){u, s) ~ 7(s - u)] 

0<u<s 

for any s > 0, from which, wc further get 

sup sup [H{A{u, s)) — (qi(s — m) + a2{s — u) — ■y{s — u))] 

0<s<t 0<u<s 

< sup sup [H{Ai{u, s)) ~ ai{s — u)] + 

0<s<t 0<u<s 

sup sup [H{A2{u, s)) ~ a2{s — u)] 

0<s<t 0<u<s 

-J'^l n>t [HM;A2){u,s)-'y(s-u)]. 

0<s<t 0<u<s 

From the above inequality, the theorem is proved with the def- 
initions of information stochastic arrival curve and low-bounded 
information stochastic arrival curve, Lemma[T] and Lemma[2] ■ 

Proof of Theorem [2] (Concatenation) . We only prove the 
two node case, because the same result can be extended to the N- 
node case. We use A^{t) and A"(t) to denote the input flow and 
the out flow of node i, respectively. Note that A^''{t) = A^{t). 
For any s > and a; > 0, we have 

HiA\s)) ® [/3I ® fS^ris) - H(A2*(s)) 
<H{A\s)) <g> ([/3I]- (B> [P^Dis) - H{A^*is)) 
= inf [H{A^u))(X,[l3^]^u) + [l3^]'^{s-u)-H{A^*{u)) 

0<u<s 

+ H{A\u))]-HiA^*{s)) 
< sup [H{A\u))(X,[l3^r{u)~ HiA'-'-iu))] 

0<u<s 

+ inf lH{A^{u)) + [p^ns - u)] - H{A^*is)) 

0<u<s 

= sup [//(Ai(M))(g)[/3i]-(M)-//(Ai*(«))] 

0<ii<s 

+ H(A2(s)) tg) [/32]-(s) - HiA^*{s)). 



We thus have for any t > 0, 

sup [H{A\s)) (g, [/3I ^/32]-(s) - H{A^*is))] 

0<3<t 

< sup [H{A\u))(X,[l3'^r{u)- HiA'^iu))] 

0<u<t 

+ sup [H{A^{s)) ® [P^r(s) - HiA*is))]. 

0<a<t 



The theorem is proved with the above inequality, the definition 
of information stochastic service curve, and Lemma [T] ■ 

Proof of Theorem [3] (Output). Based on the properties of 
information, for any < s < t, we have 

H{A*{s,t)) = H{A*{t))-H{A*{s)) 
<HiAit))-HiA'is)) 

=H{A{t)) - H{A{s)) ® Wis) + H(A(s)) ® [I3f{s) - H(A*(s)) 
= sup [H{A{t))-H{A{u))-[pr{s-u)] 

0<u<s 

+ [H(A(s))^[f3ns)-HiA'is))] 
= sup lH{A{u,t))~a{t-u) + a{t~u)-ll3f{s-u)] 

0<u<a 

+ [H(A(s))(g[l3ns)-H (A' is))] 

< sup [H{A{u,t)) - a{t - u)] + a0 I3{t - s) 

0<n<t 

+ [H(A(s))(g[f3ns)^HiA'is))] 
We thus have 

sup lH{A''{s,t)) - a0 /3{t ~ s)] 

0<s<t 

< sup lHiAiu,t)) - a(t-u)]+ sup [H{A{s)) (g [/3f (s) - H{A* (s))] 

0<u<t 0<s<t 

from which together with simple manipulation, we further get 
sup sup [H{A*{u,s)) - a0 /3{s - u)] 

0<a<t 0<u<a 

< sup sup [H{A{u, s)) ~ a{s — u)] 

0<a<t 0<u<a 

+ sup lH(A(s))g,m-is)-HiA'(s))]. 

0<s<t 

From the above inequality, the theorem is proved with the def- 
inition of information stochastic arrival curve, the definition of 
information stochastic service curve, and Lemma [T] ■ 

Proof of Theorem [4] (Service guarantee). 1) For the in- 
formation backlog B{t), wc have for any t,x > 0, 

B(t)=H(A{t))-H(A'(t)) 
=H{A{t)) - H(A(t)) » [/3]-(t) + H(A{t)) ® Wit) - HiA'it)) 
= sup [H{A{s, t)) - a{t -s) + a{t - s) - W{t - s)] 

0<B<t 

+ [H{A{t))g>[(3nt)-H{A'(t))] 

< sup [H{A{s,t)) - a{t ~ s)] + a0 I3{0) 
0<s<t 

+ [H{A{t))gi[(Snt)-HiA*m 

< sup [H(A(s,t)) - a{t - s)] + a(d 13(0) 
0<s<t 

+ sup lHiAis))»lf3ris)-H(A*(s))] 

0<a<t 

The result is proved with the above inequality, the definition of 
information stochastic arrival curve, the definition of information 
stochastic service curve, and Lemma [T] 

2) For the information delay D{t), we have from the definition, 
for any y > 0, {D{t) > y} C {H{A{t)) > H{A*{t + y))} and 
hence Prob{D{t) > y} < Prob{H{A{t)) > H{A*{t + y))}. We 
also have: 

H(A(t))-H{A*(t + y)) 
=H(A(t)) - H{A{t + y)) ® m^{t + y) + H{A(t + y)) ® [f5f {t + y) 
+ a(t - s) - a{t - s)~ H{A*{t + y)) 

< sup [H(A(s,t)) - a{t - s)] 

0<a<t 



+ H(Ait + y))i 



%t + y)-HiA''it + y)) 



+ sup lait-s)-W(t-s + y)] 

0<a<t + y 



By replacing y with h[a+x, [/3]^) in above, where h{a + x, [/9]^) = 
supjj^Q{inf {t > : 0-{s) + x < [/3]^(s + t)}} is the maximum 
horizontal distance between functions «(t) + 3; and [0\^{t), which 
implies a{t) + x < [PY{t + h{a + x, [0\^)), we obtain 

H(A(t)) - H(A* (t + h(a + X, 

< sup [H(A(s,t)) - a(t - s)] 

0<s<t 

+ H{A{t + h{a + X, [PD)) (g) [fSnt + h{a + x, 
- H{A*{t + h{a + x,[l3r))) -X 

< sup [_ff(A(s,t)) - Q!(t - S)] + 
0<s<t 

sup [H{A{s + h{a + X, [/3]"))) (g) + h{a + x, [/3]")) 

0<s<t 

-H{A*{s + h{a + x,[l3r)))]-x 

Based on the above inequality, the definition of information stochas- 
tic arrival curve, the definition of information stochastic service 
curve, and Lemma [T] we have Prob{D{t)} > h{a + x, [/3]^)} < 
f'»g{x). 

3) For the information backlog within delay bound t(< D{t)), 
using the same derivation as in (2), we have 

B{t,T) = H{A{t)) - H{A* {t + t)) 
< sup [H{A{s,t)) - a(t - s)] 

0<a<t 

+ H{A{t + r)) (g) [f3r{t + r) - H{A*{t + r)) 
+ sup [a(t-s)-[/3]^(t-s + T)] 

0<S<t + T 

= sup [_ff(A(s,i)) - Q(t - s)] 

0<s<t 



+ H(A(t + T))i 



'{t + r)- H(A*(t + T)) 



- inmr{v)-a{v^T)] 

i;>0 

< sup [H{A{s,t)) - a(t ~ s)\ 

0<a<t 

+ sup [/f(yl(s + r))(g)[/3]^(s + T)-i/(A*(s + T))] 
0<s<t 

— inf [/3(d) — 0(11 — r)] 

The result is proved with the above inequality, the definition of 
information stochastic arrival curve, the definition of information 
stochastic service curve, and Lemma [!]■ 

Proof of Theorem \5\ (Service reduction with impair- 
ment). We could treat the system as if it provides service to 
the "aggregate" of two fiows: the input A{t) and the impairment 
l{t). Denote the "aggregate" as A(t). There holds H{A{t)) = 
H(A{t)) + H{i{t)) and H{A*{t)) = //(yl*(i)) + H{X*(t)), where 
A''{t),A*(t), and i*(t) are the outputs of A(t),A{t), and i{t), 
respectively. We have for any > 0, 

H{A{s)) (g) [/3-a]-(s) -H(A*(s)) 

< inf [H(A(u))- H(i{u)) + [l3Y'{s-u)-a{s-u)] 

0<u<s 

~H{A*{s)) + H{I'{s)) 

< [H(i(s)) ® ll3r(s) ~ HiA'is))] - inf [/f(j(«)) + a(s - u)] 

0<u<s 

+ H(I'(s)) 

< IHiAis)) ® ll3ris) - /f(i*(s))] - inf mxiu)) + a{s - u)\ 

0<u<s 

+ H(iis)) 

= [H{A{s))^[l3]'^{s)-H(A*{s))]+ sup [H{i{u, s)) - a{s - u)] 

0<u<a 

We hence have for any t < 0, 

sup [/f(A(s))g)[/3-a]-(s)-i/(A*(s))] 

0<s<t 

< sup IHiAis)) (g l(3ris) - H(A* is))] 

0<3<t 

+ sup sup [Hil{u, s)) — a{s — u)]. 

0<s<t 0<u<s 



The theorem is proved because the system provides A{t) with 
service ^i.s.s.< g^P > and the impairment process I follows 
~i.a.a.< f,a >.m 

Proof of Theorem [6] (Parallel servers). We only prove 
the case of two parallel nodes, because the result can be easily 
extended to the case of A'^ parallel nodes. Because sub-fiows are 
information exclusive, we have i?(A(t)) = _f/(Ai(t)) + H{A2it)). 
Let Pit) = P'^it) + /3^it). Because the weight assigned to sub- 
fiow Ai in the weighted information splitter is for any s > 0, 

H(A(s)) - = H{A{s)) - TW = wJiere 

Ai(s) + A2(s) = 1. Therefore, for any s > 0, we have: (Due to 
space limitation, in the following derivation, we simply use j3, f}^, 
(fi to respectivaly represent [/3]^, [P^Y ^-nd [/3^]^.) 

HiAis)) ® Pis) -HiA* is)) 
= HiAis)) (g Pis) - HiAiis)) g) p\s) - HiAiis)) (g, p'^is) 
+ [HiAiis))(g,p\s)-HiAlis))] 

+ [HiA2is))<^p\s)-HiAlis))]. (26) 
Looking at the first three items, we have 

HiAis))® Pis) - HiAiis))(S>PHs) - HiAiis)) (g)P^ is) 
= HiAis)) Pis) - (Ai(s)H(A(s))) g) iXiis)Pis)) 

-(A2(s)/f(A(s)))®(A2(s)/3(s)) 
= inf [HiAiu)) + Pis ~ u)] 

0<u<s 

- inf [\i iu) H i Aiu)) + \i is -u)P is -u)] 

0<u<s 

- inf lX2iu)HiAiu)) + X2is ^ u)Pis ~ u)] (27) 

0<u<s 

Assume that inf o<u<a [Ai (M)/i'(A('ii)) + Ai(s — u)Pis — u)] = 
Ai(ni)H(A(«i)) + Ai(s-ui)/3(s-ui), info<„<s[A2(«)//(A(u)) + 
\2is-u)Pis-u)\ = X2iu2)HiAiu2)) + \2(s-U2)Pis-U2). With- 
out loss of generality, assume that 0<mi<M2<s. 

Case 1: If during the period (jii,U2], -ff(A(ui,U2)) > /3(«i — 
M2), Equation H27| l is 

<H(A(Mi)) + Pis - «i) - Ai(«i)H(A(«i)) - Ai(s - «i)/3(s - Ml) 

- A2(«2)-H'(A(m2)) - A2(s - U2)Pis - U2) 
=A2(«2 - «i)[/3(«2 - "1) - //(A(«i,-U2))] < 

Case 2: If during the period (Mi,ii2], -ff (A(«i , 112)) < Piui — 
U2), Equation H27| l is 

<HiAiu2)) + Pis - U2) - Ai(«i)H(A(Mi)) - Ai(s - «i)/3(s - uj) 

- >'2iu2)HiAiu2)) - A2(s - U2)Pis - U2) 

=Xiiu2 - ui)[HiAiui,U2)) - Piu2 - ui)] < 

Therefore, Equation II27I I is no larger than in any case. For 
Vt > 0, Equation ||26|I thus becomes: 

sup [HiAis)) ® (/3I + p^)is) - HiA*is))] 

0<s<t 

< sup [H(Ai(s))®/3i(s)-H(A*(s))] 

0<s<t 

+ sup [HiAiis)) (g> p'^is) - HiA*2is))] 

0<s<t 

The theorem is hence proved with the above inequality, the defi- 
nition of information stochastic service curve, and Lemma [T] ■ 

Proof of Theorem \7\ We have 

HiAit)) = HiAiit)) + HiAiit)) - li^iit); ^2it)), 

and 

HiA'-it + T))=HiAlit + r)) + HiA*2 it + T)) 
-IiAlit + r);Alit + r)). 

The theorem is obvious because _f/(A(t)) = H(A*(t+T)), _ff(A5;(t-f 
r)) = HiAiit)), and /f(A^(t + r)) = H(A2{t)).B 



